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1. INTRODUCTION 

Wireless sensor networks (WSNs) have gained great importance in the last few years in various 
monitoring and control applications such as military applications, smart building, health monitoring [1], 
recognition, environment control and precision farming [2]. This interest is expected to grow further with the 
high evolution of wireless systems. An increasing number of applications have been related to distributed 
monitoring by exploiting real-time position estimation. 

Localization [3], [4] has became a very important need in these applications. Accordingly, 
researchers continue seeking efficient solutions for localization and tracking services which should be able to 
make a tradeoff between robustness, precision and real time availability. Global positioning system (GPS) is 
popular method of the localization service; however, its employment in indoor environments is not possible. 
Therefore, researchers seek for alternative solutions based on WSNs. In fact, many strategies using different 
metrics like received signal strength indicator (RSSI) [5]-[7], angle of arrival (AOA) [8], time of arrival 
(TOA) [9] and impulse responses (IR) [10], have been suggested. Localization using RSSI has been proposed 
as this radio indicator is always available and easily measured, while the other existing techniques often 
require expensive and complex hardware and present many constraints such as the high cost and the memory 
and energy limitation. On the other side, indoor environments present many constraints like interference, 
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reflections, shadowing due to obstacles and pathloss fading. Consequently, RSSI is submitted to high level of 
fluctuations that will cause negative effects on the accuracy of the localization results. Particularly, the main 
challenge is to reduce this perturbation in dynamic scenario. So, it has been suggested to develop learning by 
example (LBE) solutions which proved their suitability in the understanding of the complex relation between 
the RSSI behavior and the target position. Many researches propose Machine learning based methods for 
localization systems such as support vector machines (SVMs) [11], neural networks (NNs), and Fuzzy Logic 
[12]. Accordingly, with machine learning methods, it is simpler to model the complex and dynamic behavior 
of RSSI in WSNs. Also, predicting mobile positions solutions using filtering have been increasingly 
employed [13]-[15]. 

The contributions of this paper are listed as follows: First contribution is emphasized on the 
framework of building the fingerprint database by means of an adequate propagation model applied to the 
multiple distributed training samples in order to address RSSI fluctuations problem. Second contribution 
proposes a tracking strategy of a moving node that combines regression tree and Kalman Smoother to have 
better performance. It uses a decision tree as a predictive model which maps the observations of training 
RSSI to the training positions. The estimated location by means of the RT is introduced to the Kalman 
smoother algorithm as the observation information, to provide more accurate state of the target. The proposed 
algorithm has been experimentally assessed using real measurement of a mobile target in an office room. The 
first section introduces the problematic. Then, Sect. 2 is devoted to explain the proposed method. In this 
section, the RT based localization algorithm using RSSI parameters is presented. Finally, the last section is 
devoted to analyze the experimental results. 

The development of smart communication technologies has enabled the emergence of new Location 
based services (LBSs) applications [16]-[18]. Indeed, indoor localization is a basic process in robotics field 
[19], [20]. Simultaneous localization and mapping (SLAM) is proposed [21], enabling localization and 
building maps of an unknown environment. The authors combine SLAM with extended Kalman filter (EKF) 
to overcome the problem of having noisy acquired data. The simulation validation shows accurate and 
consistent positioning particularly when the landmarks number is increased. However, in the context of 
mobile autonomous robots which are limited in terms of energy, the SLAM method requires higher 
computational cost and its complexity should be studied specially when dealing with large maps. 

Geolocalization is well used to develop efficient routing algorithm [22]. Developing routing method 
for emergency healthcare applications is a challenging topic. Singh et al. [23] proposed an efficient routing 
algorithm based on geolocation (W-GeoR) for VANET’s health monitoring applications. The W-GeoR 
method is emphasized on selecting the best next-hop node to determine the optimal path. 

The advancement of micro-electro-mechanical systems (MEMS) has enabled the growth of the 
unmanned aerial vehicle (UAV). Mostafa et al. [24], design an agent-based autonomous flight control (AFC) 
architecture for UAV, enabling autonomous navigation and routing. Among the proposed functionalities, 
localization is suggested to perform the tracking of UAV. The evaluation results confirm the performance of 
the proposed architecture in terms of precision, quality and power consumption. In similar context, 
autonomous vehicles have seen tremendous growth in recent years [25]. Localization and motion detection 
are basic parts of autonomous vehicle. Several recent studies [26]-[28] have suggested solutions to determine 
the path tracking of the autonomous vehicle. Vivacqua et al. [29] have proposed a low-cost architecture in 
terms of required equipments and algorithm complexity. It uses filtering method to reduce data uncertainty 
and improve the solution robustness. 

Many researchers have studied range-based localization using specific metric to enhance 
localization accuracy. For instance, Thimmaiah and Mahadevan [30] have proposed a TOA based 
localization method in wireless sensor networks. It has been proved that this method outperforms other 
existing approaches in terms of localization error tolerance. While ToA based localization systems provide 
accurate results, it is essential to have synchronized clocks. Synchronization has a negative impact on the 
localization system complexity. 


2. RESEARCH METHOD 
2.1. Problem formulation 

In this section, we descibe the experimental environment where the performance of the localization 
algorithm will be evaluated. We consider the indoor environment with size X = 9 [m] and Y = 7.1 [m]. As 
shown in Figure 1. A set of L reference nodes (anchors) are available at fixed positions. As we consider the 
dynamic scenario, the moving target will acquire the RSSI data from anchors. The collected RSSI data will 
be provided at particular positions. The IEEE802.15.4 standard working at the operating frequency f=2.4 
[GHz] has been used in this experiment. 
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@ Test node 
¢ Anchor node 


Figure 1. Indoor environment 


The position state evolution over time is depicted in Figure 2. As shown, each current position 
estimation X,,, is based on the previous estimation X, by predicting each time the covariance error denoted 
P. The updating equations yield to more refined position estimation Y at each time instant. 


WY ra. Xiz) 
Figure 2. Target tracking problem 


We denote unknown state vector at time t as x(t), including the position of the moving sensor. The 
adopted mobility [31] equation is given by (1): 


x(t) = Fx(t-1)+W (1) 


with F is the state transition matrix and W = N (0, Q) is the the noise term given by a Gaussian distribution 
with zero mean and covariance Q. The observation equation including the measurement is given by (2): 


z(t) = Hx(t)+V (2) 
with H is the observation matrix and V = N(0,R) is the observation noise with covariance denoted R. 
2.2. Proposed tracking algorithm 
2.2.1. Channel modeling 

To characterize the indoor environment propagation, it has been suggested to refer to the pathloss 


log-normal model [32] for its efficiency and simple implementation. Accordingly, the received power can be 
defined as (3), 


d 
Pre = Pix + Ke — 109 log () +X, (3) 
0 
where P,, is the transmitted power, 7 is the pathloss coefficient which depends on the environment 


characteristic, dg is the reference distance, d is the transmitter-receiver distance and K, is a propagation 
constant given by (4). 


K, = 20 log (=) (4) 
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The noise X, is modeled by a Gaussian random variable with zero-mean and variance o. Let us 
consider an optimized channel modeling by selecting an optimal pathloss exponent 7 by focusing on 
minimizing the minimum mean square error (MMSE) given by (5): 


min[|Frx(7) — RSSI,|?] (5) 


where RSSI,., is the RSSI measured by one of the selected anchors. As discussed in [17], the optimized 
channel model is emphasized on the changing behavior of the pathloss exponent according to the distance. 
Thus, the new model focuses on two different pathloss exponents ņı and yzas described in the following 
equation. 


Pix + Ke — 10n; log (=)ifa < 20A 

Pry = is 

Pix + Ke — 10n; log (=) otherwise 
0 


2.2.2. Observation model: RT-based localization 

This section aims at introducing the RT based algorithm using RSSI [33]. The algorithm starts with 
defining M training samples presented in uniform distribution as indicated in Figure 3, where 6 is the interval 
between two samples. As shown in Figure 4, the RSSI transmitted by the anchor nodes and collected by the 
moving target represent the new input data to RT method. The predicted result will be considered as the 
observed data z(t). The proposed model is presented in the following sub-section. 
a) Definition of the training data: Machine learning methods are emphasized on two main steps, the training 


step and the testing or prediction step. The input information P = (eee is the received signal 


strength at samples m = 1,..., M and transmitted by reference node l. The location of each training sample 
is denoted as fn = (Xm) Ym). 
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Figure 3. Training data 
b) Regression tree algorithm using RSSI: This section describes the RT method. As presented in Figure 4. 
As shown in Figure 4, the algorithm starts by the training phase, where the defined training samples with 
their corresponding locations tm = ({Xm, Ym}%-1) are the input training data. The main goal of the 
training process is to construct the model which maps the RSSI data to the locations. This method is 


based on minimizing the mean square error € given by the following expression where 7%, is the actual 
training position and F is the estimated position during the training. 


1 uM z 

O (tin — #)? (7) 
Xm=1 Tin (8) 
Then, the algorithm focuses on splitting the training set and minimizing the error resulting from the 
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Figure 4. RT-based localization method 


c) Kalman smoother: As previously mentioned, the KS is introduced to have more refined position 
estimation. One of the common methods used in KS is the Rauch-Tung-Striebel (RTS) smoother 
algorithm. RTS is an efficient two-pass algorithm operating in two stages: the forward and the backward 
steps. The first one is defined as the regular Kalman filter (KF) using recursive model for state estimation. 
The different steps of the KS method are illustrated in Figure 5. The goal of prediction phase of KS is to 
estimate the future position based on the current location as given by (10). 


x(t/t-—1) = Fx(t-—1)+W (10) 
The covariance is given by (11): 
P(t/t —1) = FP(t—1)FT+Q (11) 


these two terms are refined during the correction phase by means of the observed data already predicted by 
the RT method. So, the Kalman gain is calculated as (12): 


G(t) = P(t/t — 1) HT (HP(t/t —-1) HT +R)-1 (12) 
the updated equations are given by the following expressions. 

x(t/t) = x(t/t — 1) + (z(t) — x(t /t — 1)H)G(t) (13) 

P(t/t — 1) = (I — G(t)H)P(t /t — 1)) (14) 

The resulting a-priori and a-posteriori state estimates, and the covariance estimates are saved in the 
backward step to be used in the smoothing phase. The optimal smoothed estimate of the state is obtained 
using the observations. In more detail, during the backward step, the state estimate of the forward computed 
before is updated in order to provide an improved smoothed estimate. The recursive equations are defined as 
(15): 

x(t/n) = x(t/t) — K,(x(t + 1/n) — x(t + 1/t)) (15) 
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P(t/n) = P(t/t) + K,(P(t + 1/n) — P(t + 1/n)) Kf (16) 
where K, is the smoother gain calculated as, 


K, = P(t/t)F7(t + 1)P-4(t + 1/t) (17) 


the iteration of KS filtering and RTSS is done between adjacent instants in order to ensure the real-time need. 
In more details, the filtering estimation at time t is used to smooth the estimated state and covariance at time 
t-1. This yield to more refined result at time k-1. 


Initial Prediction Phase 
conditions 


-Time update 
-Estimate state and its 
covariance 


Correction Phase 


-Compute Kalman gain 
-Correction of the state 
and its covariance 


Smoothing Phase 


-Compute the smoothed 
state and its covariance 


Figure 5. Block diagram of Kalman smoother algorithm 


3. RESULTS AND DISCUSSION 

The proposed algorithm is assessed using real measurements gathered in an indoor environment 
with size X = 10 m and Y = 8m, where 8 reference nodes are deployed at fixed positions. The WSN uses 
the frequency 2.4 GHz. The experimental environment is depicted in Figure 1. For the training data, we 
consider M=285 training samples uniformly distributed and generated by the pathloss model. The basic 
metric to evaluate the adopted solution is the relative average localization error p defined as (18): 


1 n 
pa z( Yaa] x 100 (18) 


where €,, is the absolute error corresponding to position with index n. 

First, we start looking at the absolute localization error for several test positions. Table 1 shows the 
predicted positions for the mobile sensor when using KS and KF. The estimation is evaluated for K=9 
acquisitions at different time instant. As it can be seen, more accurate position estimation when using KS 
compared with KF method. For example, the minimum error is €g = 0.1m at t = 8 while the maximum 
value is €, = 1.08 m. 


Table 1. Absolute error of KS and KF for different time instant 
time instantt t=1 t=2 t=3 t=4 t=5 t=6 t=7 t=8 t=9 
[m] for KF 0.46 0.61 1.04 0.91 1.10 1.30 0.95 0.11 0.75 
e [m] for KS 0.34 0.87 0.35 0.30 0.92 1.08 0.44 0.10 0.75 


To have fair comparison with the existing state of the art methods, we assess the algorithms in terms 
of relative localization error and variance. Figure 6 and Figure 7 show the comparison between the proposed 
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algorithm, the RT and using KF by focusing on the two metrics. As clearly shown, we obtain lower error and 
variance with the suggested method. The combination of RT with KS presents an error 5.8%, while it's 
10.6% and 12.5% for RT combined to KF and RT respectively. Similarly, the variance of the proposed 
algorithm is 0.12 m which is a low value. This proves that our method provides high accuracy for all the 
estimated positions despite the rapid changes and incertainty that characterize the environement. 
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Figure 6. Localization error for the proposed algorithm versus RT, RT+KF 
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Figure 7. Variance for the proposed algorithm versus RT, RT+KF 


Now, we evaluate the impact of the training samples number considerd during the training phase. 
Consequently, we will vary the distance ò that separates two training samples as the uniform distribution is 
considered in this work. Figure 8 and Figure 9 report the average absolute error £ and its variance in function 
of the training input data by comparing the proposed algorithm to the existing state of the art strategies: SVR, 
Naive Bayes, and the trilateration. 

In Figures 8 and 9, the suggested method outperforms the other algorithms in terms of error and 
variance when changing the training samples number. Increasing the number of training samples does not 
improve the precision. This is due to fluctuations and incertainty of RSSI data. Accordingly, selecting few 
numbers M=285 training samples will suffice to have optimal results in terms of accuracy and complexity. 
The comparison among the proposed method and the machine-based localization algorithms: RT, SVR and 
NB is depicted in Figure 10. As it can be noticed, KS with RT improves the accuracy and outperforms the 
well known machine learning methods when considering the relative localization error. 
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Figure 10. Localization error for the proposed algorithm versus RT, SVR, and NB 
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4. CONCLUSION 

In this paper, we address the wireless sensor tracking in an indoor area. The work focuses on using 
RSSI parameter to reach the accuracy objective. Moreover, this metric is low cost since it'a already available 
in all wireless equipments. However, this acquired data is submitted to many environmental constraints like 
fluctuations and incertainty. Consequently, we propose to use machine learning algorithm to overcome the 
issue of the environment complexity. During the training phase, the algorithm starts the data collection by 
distributing the training samples in the area and determining the corresponding RSSI using the pathloss 
model. During the testing phase, real RSSI measurements of a mobile sensor are considered as the input of 
the trained model. Consequently, the positions are estimated using the regression tree model already built 
during the training step. As our major concern is accurate positionning, the regression tree is combined with 
the Kalman Smoother method yielding to more refined localization estimation. The suggested method has 
been evaluated using real RSSI measurements and compared to existing learning by example algorithms. The 
results show the suggested approach outperforms other methods, giving low localization error. 

As mentioned before, the RSSI presents low-cost metric, while this radio parameter is too sensitive 
particularly when considering dynamic scenario in an indoor environment. Accordingly, efficient techniques 
of filtering would be studied and discussed to reduce the real RSSI measurements fluctuations together with 
the improvement of the propagtion model. Moreover, real time availability is one of the localization goals 
that would improve. The delay of the position estimation would be reduced particularly when dealing with 
emergency applications. 

Future work intends to improve the suggested approach by considering multi moving targets. Also, 
the method will be adapted to a more complex indoor environment with obstacles. On the other side, the 
sensor node is limited in terms of battery. Reducing the computational cost of the localization method is a 
major concern. Thus, the next challenge is to reduce the algorithm complexity. 
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